A dynamic semantic model for re-scoring recognition hypotheses
نویسندگان
چکیده
This paper describes the use of Belief Networks (BNs) for dynamic semantic modeling within the United Airlines’ FLight InFOrmation service (FLIFO). Callers can speak naturally to obtain status information about all flights (including arrival and departure times) of United Airlines. In this work we aim at enabling the application to utilize dynamic call information to improve speech recognition performance. Dynamic call information include the location of the caller, the time and date of the call, and the caller's dialog history. Dynamic semantic models can incorporate such additional information about the call in rescoring the N-best recognition hypotheses. Our experiments showed that this improved the recognition accuracy of flight number utterances from 84.95% to 86.80%.
منابع مشابه
Approximate inference: A sampling based modeling technique to capture complex dependencies in a language model
In this paper, we present strategies to incorporate long context information directly during the first pass decoding and also for the second pass lattice re-scoring in speech recognition systems. Long-span language models that capture complex syntactic and/or semantic information are seldom used in the first pass of large vocabulary continuous speech recognition systems due to the prohibitive i...
متن کاملتشخیص دستنوشتۀ برخط فارسی با استفاده از مدل زبانی و کاهش قوانین نگارش کاربر
The Joint-up, cursive form of Persian words and immense variety of its scripts, also different figures of Persian letters depending on their sitting positions in the words, have turned the Persian handwritings recognition to an intense challenge. The major obstacle of the most often recognition ways, is their inattention to sentence contexture which causes utilizing of a word with correct appea...
متن کاملSearch and Decoding Strategies for Complex Lexical Modeling in Lvcsr
The language model (LM) in most state-of-the-art large vocabulary continuous speech recognition (LVCSR) systems is still the n-gram. A major reason for using such simple LMs, besides the ease of estimating them from text, is computational complexity. It is also true, however, that long-span LMs, be they due to a higher n-gram order, or because they take syntactic, semantic, discourse and other ...
متن کاملSemantic Coherence Scoring Using an Ontology
In this paper we present ONTOSCORE, a system for scoring sets of concepts on the basis of an ontology. We apply our system to the task of scoring alternative speech recognition hypotheses (SRH) in terms of their semantic coherence. We conducted an annotation experiment and showed that human annotators can reliably differentiate between semantically coherent and incoherent speech recognition hyp...
متن کاملSupport Vector Machines for Postprocessing of Speech Recognition Hypotheses
In this paper, we introduce an approach to improve the recognition performance of a Hidden Markov Model (HMM) based monophone recognizer using Support Vector Machines (SVMs). We developed and examined a method for re-scoring the HMM recognizer hypotheses by SVMs in a phoneme recognition framework. Compared to a stand-alone HMM system, an improvement of 9.2% was reached on the TIMIT database and...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001